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We explore statistical inference in self-similar conservative frag- 
mentation chains, when only (approximate) observations of the size 
of the fragments below a given threshold are available. This frame- 
work, introduced by Bertoin and Martinez, is motivated by mineral 

■ crushing in mining industry. 

Q\ ' The underlying estimated object is the step distribution of the 

■ random walk associated to a randomly tagged fragment that evolves 

along the genealogical tree representation of the fragmentation pro- 
cess. We compute upper and lower rates of estimation in a para- 
metric framework, and show that in the non-parametric case, the 
difficulty of the estimation is comparable to ill-posed linear inverse 
problems of order 1 in signal denoising. 



Keywords: fragmentation chains; parametric, non-parametric estimation, 
key renewal theorem. 
Mathematical Subject Classification: 60J80, 60J25,62G05, 62M05 . 
e-mail: marc.hoffmann@univ-mlv.fr, nathalie.krell@courriel.upmc.fr 



*Universite Paris-Est, Laboratoire d'Analyse et de Mathematiques Appliquees, 
CNRS-UMR 8050, 5, boulevard Descartes, 77454 Marne-la-Vallee Cedex 2, France. 

^Universite Paris 6, Laboratoire de Probabilites et Modeles Aleatoires, CNRS-UMR 
7599, 16, rue Clisson, 75013 Paris Cedex, France. 



1 



1 Introduction 



1.1 Motivation 

Random fragmentation models, commonly used in a variety of physical 
models, lay their theoretical roots in the works of Kolmogorov [llj and 
Filippov [9] (see also [H [131 E] an d the references therein). Informally, 
we imagine an object that falls apart randomly as time passes. The re- 
sulting particles break independently of each other in an independent and 
self-similar way. A thorough account on random fragmentation processes 
and chains is given in the book by Bertoin [5j, a key reference for the 
paper. 

In this work, we take the perspective of statistical inference. We focus 
on the quite specific class of self-similar fragmentation chains. The law of 
the fragmentation is then entirely determined by its dislocation measure 
and its index of self-similarity, which govern the way and the rate at which 
the fragments split. If one is allowed to observe the whole fragmentation 
process up to some fixed time, then the statistical problem is somehow 
degenerat^j]. We postulate a more realistic observation scheme, motivated 
by mining industry, where the goal is to separate metal from non valued 
components in large mineral blocks by a series of blasting, crushing and 
grinding. In this setting, one rather observes approximatively the frag- 
ments arising from an initial block of size m only when they reach a size 
smaller than some screening threshold, say r/ > 0. Asymptotics are taken 
as the ratio e := rj/m vanishes. See Bertoin and Martinez [7] and the 
references therein. 

1.2 Organization and of the paper 

In Section 2, we recall some well known facts about the construction of 
conservative fragmentation chains, following closely the book by Bertoin 
[5]. For statistical purposes, our main tool is the empirical measure S £ of 
the size of fragments when they reach a size smaller than a threshold e 
in the limit e — * 0. We highlight the fact that S £ captures information 
about the dislocation measure through the Levy measure tt of a randomly 
tagged fragment associated to the fragmentation process. 

1 in the sense that it can be mapped into relatively standard equivalent inference 
problems such as probability distribution estimation from independent observations, 
see Section \4. 41 
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In Section 3, we give a rate of convergence for the empirical measure 
£ £ toward its limit in Theorem 13 .1\ extending former results (under more 
stringent assumptions) of Bertoin and Martinez [7j. The rate is of the 
form e 1 / 2 ~^( 7r ), where £(ir) > can be made arbitrarily small under ade- 
quate exponential moment conditions for ir. We add-up the more realistic 
framework of observations with limited accuracy, where each fragment is 
actually known up to a systematical stochastic error of order a <C £■ 

In Section 3, we construct estimators related to functionals of it in 
the absolute continuous case. In the parametric case (Theorem I3.4p . we 
establish that the best achievable rate is e 1//2 in the particular case of 
binary fragmentations, where a particle splits into two blocks at each 
step exactly. We construct a convergent estimator in a general setting 
(Theorem 13 . 2[) with an error of order e 1 / 2 ~^( 7r ) ) for another £'(tt) > 
that can be made arbitrarily small under appropriate assumptions on the 
density of ir near and +oo. In the non-parametric case, we construct an 
estimator that achieves (Theorem l3.5p a rate of the form (e 1- ^ ( 7r )) s /( 2s + 3 ) i 
where s > is the local smoothness of the density of tt, up to appropriate 
rescaling. Except for the factor l"(ir) > 0, we obtain the same rate as 
for ill-posed linear inverse problems of degree 1. We suggest a simple 
interpretation of this result in terms of the asymptotic form of £ £ in the 
discussion Section 4, appended with further remarks about Theorems 13. 1| 
I3T21 and 1331 

An appendix (Section 6) recalls sharp results on the key renewal the- 
orem from Sgibnev [16] that are used to derive Theorem 13.11 

2 Statistical model 

2.1 Fragmentation chains 

Let X = (X(t),i > 0) be a fragmentation chain with state space 

oo 

S l := {s = (s 1 ,s 2 , • • •), si > s 2 > ■ • • > 0, < l}. 

i=i 

We assume that X has self-similar parameter a > 0. For well-definiteness, 
see e.g. Bertoin [5], the following mild assumptions on the dislocation 
measure v(ds) of X are in force throughout the paper: 

Assumption A. We have z/((l,0, ...)) = and v(si G (0,1)) > 0. 
Moreover, for every e > 0: j sl YliZi l{s l >e} l/ (^ s ) < °°- 
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We denote by P m the law of X started from the initial configuration 
(m, 0, ...) with m £ (0,1]. Under P m , X is a Markov process and its 
evolution can be described as follows: a fragment with size x lives for an 
exponential time with parameter x a v(S^) and then splits and gives rize 
to a family of smaller fragments distributed as x£, where £ is distributed 
according to v{*)/v(S^). Under P m , the law of X is entirely determined 
by a and v{»). 

We will repeatedly use the representation of fragmentation chains as 
random infinite marked trees. Let 

oo 

U : = \J re 

n=0 

denote the infinite genealogical tree (with N° := {0}) associated to X as 
follows: to each node u £ U, we set a mark 

ui O'Uj Cu)> 

where £ u is the size of the fragment labelled by it, a u is its birthtime and 
£ u is its lifetime. We have the following identity between point measures 
on (0, +oo): 

oo 

E 1 {x lW >o}^w = E Vk^)}^' 1 - °' 

i=l uGU 

with X(t) = (X\(t), X2(t), ■ ■ ■ ) , and where S x denotes the Dirac mass 
at x. Finally, X has the following branching property: for every fragment 
s = (si, . . .) G and every t > 0, the distribution of X(t) given X(0) = s 
is the same as the decreasing rearrangement of the terms of independent 
random sequences X^(t), XW(t),... where, for each i, X®{t) is dis- 
tributed as X(t) under P s< . 



2.2 Observation scheme 

For simplicity, we assume from now on that u(S^) = 1. Keeping in mind 
the motivation of mineral crushing, we consider the fragmentation undeil§ 
P := Pi, initiated with a unique block of size m = 1 and we observe the 

2 We often need to accomodate further random variables independent of X. Abusing 
notation slightly, we will still use the notation P without further notice, working tacitly 
on an appropriate enlargement of the original probability space. 
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process stopped at the time when all the fragments become smaller than 
some given threshold e > 0, so we have data £ u , for every u 6 IA £ , with 

U £ := {u G W, > £, C„ < e}, 

where we denote by u— the parent of the fragment labelled by u. We will 
further assume that the total mass of the fragments remains constant 
through time: 

Assumption B. ( Conservative property). We have: i/( X^Si s i = l) = 1- 
We next consider the empirical measure 

where g(») is a test function. Indeed, under Assumption B, we have 

^2 €u = 1 IP — almost surely, (2.1) 

u£U E 

so S e (g) appears as a weighted empirical version of g(»). Bertoin and 
Martinez show in [7] that under mild assumptions on the measure 
£ e (g) converges to 

1 ( ) f °° 

£{g) ■= 7T / — / V Si 1 { <a }i/(ds)da 
c(f) Jo a Jsifr! 

in probability, as e — ► 0, with c(z^) = — J* 5i s jl°g s « i'(cfe), tacitly 
assumed to be well-defined. This suggests a strategy for recovering infor- 
mation about z/(») by picking suitable test functions <?(•). 

2.3 First estimates 

From now on, we assume we have data 

X £ ■= (£„, u £ U e ) (2.2) 

and we specialize in the estimation of v{»). Clearly, the data give no 
information about the self-similar parameter a that we consider as a 
nuisance parameteiH. Assumptions A and B are in force. At this stage, 

3 See however Section \4. 41 for auxiliary results about the inference on a. 
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we can relate £{g) to a more appropriate quantity by means of the 
so-called tagged frament approach. 



The randomly tagged fragment. Let us first consider the homogenous 
case a = 0. Assume we can "tag" a point at random -according to a 
uniform distribution- on the initial fragment and imagine we can follow 
the evolution of the fragment that contains this point. 

Let us denote by > 0) the process of the size of the frag- 

ment that contains the randomly chosen point. This fragment is a typical 
observation in our data set X e , and it appears at time 

T £ :=inf{t>0, X (t)<e}- 

Bertoin [5] shows that the process ((t) := — logx(t) is a subordinator, 
with Levy measure: 

oo 

ir(dx) := e" x ^^(-logSj G dx). (2.3) 
i=i 

We can anticipate that the information we get from X e is actually in- 
formation about the Levy measure ir(dx) of throughout CC^e)- The 
dislocation measure v(ds) and ir(dx) are related by (|2.3j) which reads 

/ y>/(*iM<k)= / f(er*)n(dx), (2.4) 

•AS J- i=1 J(0,+oo) 

for any suitable /(•) : [0, 1] — ► [0, +oo). In particular, by Assumption B 
and the fact that u(S^) = 1, n(dx) is a probability measure hence is 
a compound Poisson process. Informally, a typical observation takes the 
form C(^e)j which is the value of a subordinator with Levy measure n(dx) 
at its first passage time strictly above — loge. The case a / is a bit 
more involved and reduces to the homogenous case by a time change. 

In terms of the limit of the empirical measure £ e (g), we equivalently 
have 

1 f 1 g(a) 1 f +oc 

£(g) = / 7r(— log a, +oo) da = / g(e x )ir(x, +oo) dx, 

c(vr) Jo a c(?r) J 

with c(ir) = J^ +oo ^ x ir(dx), the two representations being useful either 
way. This approach will prove technically convenient and will be detailed 
later on. Except in the binary case (a particular case of interest, see 
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Section I4.ip . the knowledge of 7r(») does not allow to recover v{») in 
general. 

Measurements with limited accuracy. It is unrealistic to assume 
that we can observe exactly the size £ u of the fragments. This becomes 
even more striking if the dislocation splits at a given time into infinitely 
many fragments of non zero size, a situation that we do not discard in 
principle. Therefore, we replace (|2.2I) by the more realistic observation 
scheme X £)(T := (£u , u G W £i<T ) with 

U e , a := {ueU, e2 >£, & ] <4> 

and 

~tu + <?U u . (2.5) 

The random variables (U u , a£W) are identically distributed, and account 
for a systematic experimental noise in the measurement of X E , indepen- 
dent of X e . We assume furthermore that, for every u €U, 

\U U \<1 and E[U U ] = 0. 

The noise level < a = a(e) <C e is assumed to be known and represents 
the accuracy level of the statistician. 

The observations £ u + aU u are further discarded below a threshold 
a < t e < e beyond which they become irrelevant, leading to the modified 
empirical measure 

£eA9)--= E l { ^>t^ i u ) 9(i { : ) /e). 
In the sequel, we take t e = 706 for some (arbitrary) < 70 < 1. 

3 Main results 

We first exhibit explicit rates in the convergence £ £ (g) —> £(g) as e — > 0, 
extending Proposition 1.12 in Bertoin0 [5]. We then turn to the estimation 
of 7r(»). 

4 See also Bertoin and Martinez [7]. 
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3.1 A rate of convergence for the empirical measure 

For k > 0, we say that a spread-outd probability measure 7r(dx) defined 
on [0, +oo) belongs to II(k) if 

/ e KX ir(dx) < +oo, 

J[0,+oo) 

appended with II(oo) := P| k>0 II(k). For m > 0, define the class of con- 
tinuous functions 

C(m) := {#(.) : [0, 1] -> R, \\g\loo := sup \g(x)\ < m}, 

X 

and C'(m) the class of continuously differentiable functions <?(•) such that 
g 1 € C{m). 

Theorem 3.1. Grant Assumptions A and B. Let < k < oo and assume 
that tt G II(k). 

• For every m > and < fi < n, we have 

sup E [(£ £ ( 5 ) - £( 5 )) 2 ] = ( £ ^ +1 )). ( 3.i) 

9 eC(m) 

• 77ie convergence (|3. 1 j) remains valid if we replace S e (») by £ £)(T (») 
and C{m) byC'(m), up to an additional error term: 

sup E [(£ £ja (g) - £ £ (g)) 2 } = 0{a 2 e~ 2 ). (3.2) 
<?ec'(m) 

3.2 Statistical estimation 

We study the estimation of 7r(») by constructing estimators based on 
£ e (») or rather £ £)(T (»). We need the following regularity assumption: 

Assumption C. The probability measure ir(dx) is absolutely continuous. 

We denote by x ~^ ir(x) its density function. We distinguish two cases: 
the parametric case, where we estimate a linear functional of 7r(») of the 
form 

r+oo 

m k{j^) '■= / x k 7r(x)dx, for some k > 1, 
5 We recall some properties on spread-out measures in the Appendix. 
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and the non-parametric case, where we estimate the function x ~» ir(x) 
pointwise. In that latter case, it will prove convenient to assess the lo- 
cal smoothness properties of 7r(») on a logarithmic scale. Henceforth, we 
consider the mapping 

a ~> (3(a) := a _1 7r(- log a), a 6 (0,1). (3.3) 

In the non-parametric case, we estimate (5(a) for every a £ (0, 1). 



3.3 The parametric case 



For k > 1, we estimate 

r+oa rl 

m k{n) ■= / x k ir(x)dx= / \og(l / a) k (3(a)da 
Jo Jo 

by the correspondence (|3.3p and implicitly assumed to be well-defined. 
We first focus on the case k = 1. Pick a sufficiently smooth test function 
/(.) : [0, 1] ->■ M such that /(l) = and let g(a) := -af'(a). Plainly 

1 f 1 9(a) 



£(g) = [ - — 7r(— log a, +oo) da 
c(vr) J a 



f'(a) / (3(u)duda = -— / f(a)(3(a)da. 



mi(ii) J Jo mi( 

Formally, taking /(•) = 1 would identify l/mi(ir) since (3(») integrates to 
one, but this choice is forbidden by the boundary condition f(l) = 0. We 
then consider instead the following approximation. Let / 7 (») : [0, 1] — ► R 
with < 7 < 1 be a smooth function such that 

• fry (a) = 1 for a < 1 — 7 and / 7 (1) = 0. 

• II/7II00 = 1 and ||/ 7 ||oo < C7" 1 , for some c > 0, 

a choice which is obviously possible. For a parametrization 7 := 7 e — > 0, 
we set gry £ (a) := — afL(a) and define 

1 

£e,<x(#7e) 

More generally, for > 1, we define successive moment estimators as 
follows. Set hry E (a) := / 7e (l — a)log(l/a) fc and 5 7e (a) := —ah' (a). Let 



™M : = 7^-^- ( 3 - 4 ) 



£e,a(gry £ ) 

m k , £ := — -i r-. 

^(f7ej 
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We can describe the performances of rhk,e under an additional decay 
condition of 7r(») near the origin. For k > 0, that the probability 

7r(») belongs to the class 1Z(k) if 

limsup x~ K+l n{x) < +00 

appended with 7£(oo) := n K >o^( K )- We obtain the following upper 
bound: 

Theorem 3.2. Grant Assumptions A, B and C. Let < k±, K2 < 00 with 
K\ > max{l, ^2}- 

For 1 < fi < Ki, let m fej£ be specified by j £ := eA*/(a*+1)(2k2+i) _ 
family 

£ -, K2 /(M + l)(2 K2+ l) (afci£ _ mfc(7r)) 

is tight under Pi as soon as 

it g ii(ki) n^(K 2 ) 

and as -3 remains bounded. 

Some remarks: the convergence of m/% i£ to mfc(-7r) is of course no sur- 
prise by (|3.ip . However, the dependence in e in the test function g e (») (in 
particular g e (») is unbounded as e — > 0) requires a slight improvement of 
Theorem 13. 11 This can be done thanks to Assumption C, see Proposition 
15.21 in the proof Section [5.21 The requirement ere -3 = 0(1) ensures that 
the additional term coming from the approximation of £ e (») by £ a , £ {») is 
negligible. This condition is probably not optimal, see Section 4. 

Our next result shows that the exponent [1K2I (/i + 1)(2k2 + 1) < 1/2 
in the rate of convergence is nearly optimal, to within an arbitrarily small 
polynomial order. 

Definition 3.3. Let 7i"o(») satisfy the assumptions of Theorem \3.2\ The 
rate < v £ — > is a lower rate of convergence for estimating m^^o) if 
there exists a family ir e (») satisfying the assumptions of Theorem \3.S\ and 
a constant c > such that 

liminf inf max P [tj" 1 ^ — 777^(71") | > c] > 0, (3-5) 

where the infimum is taken (for every e) over all estimators constructed 
with X £jCT at stage e. 

6 In the notation, we identify the probability measure -ir(dx) and its density function 
x ^> n{x) when no confusion is possible. 
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Definition 13,31 expresses a kind of local min-max information bound: 
given 7To(»), one can find an opponent 7r £ (») such that no estimator can 
discriminate between 7To(») and 7r e (») at a rate faster than v e . 

We further restrict our attention to binary fragmentations, see Section 
14.11 In that case, the dislocation measure satisfies v{s\ + S2 ^ 1) = 0, and, 
because of the conservation Assumption B, can be represented as 



where p{») is a probability measure on [1/2, 1]. 

Assumption D. (Binary case.) The probability measure /?(•) associated 
to tt(») is absolutely continuous and its density function is bounded away 
from zero. 

Theorem 3.4. Assume that the fragmentation is binary and grant As- 
sumption D. In the same setting as in Theorem \3.S\ the rate e 1 ^ 2 is a 
lower rate of convergence for estimating mk{%). 

The restriction to the binary case is made for technical reason and is 
inessential. Theorem 13.41 presumably holds in a more general setting. 

3.4 The non-parametric case 

Under local smoothness assumptions on the parameter /?(•), we estimate 
/3(a) for every a G (0,1). Given s > 0, we say that /?(•) belongs to the 
Holder class S(s) if there exists a constant c > such that 



where s = n + {s}, with n a non-negative integer and {s} G (0, 1]. We also 
need to relate /?(•) to the decay of its corresponding Levy measure 7r(»). 
Abusing again notation, we identify H(k) with the set of /?(•) such that 
e x (5{e~ x )dx G thanks to the inverse of (j3.3|) . Likewise for 1Z(k). 

We construct an estimator of /?(•) as follows: for a G (0, 1) and a 
normalizing factor < j £ — > 0, set 



where (p(») is a smooth function with support in (0, 1) that satisfies the 
following oscillating property: for some integer N > 1, 



v(ds) = p(ds 1 )5i- Sl (ds2) 



(3.6) 



(3 {n) (y) -(3 {n \x)\ <c\y-x 



¥> 7e> a(») := le V((» - a)he) 




(3.7) 
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Our estimator then takes the form 

%{a) :=mi |E f £)(T (-.^ ei(! (.)) aG (0,1), 

where rhi e is the estimator of mi{ir) defined in (|3.4p . We then have the 
following 

Theorem 3.5. Grant Assumptions A, B and C. Let 1 < K\,K2 < oo. 

Forl<fi< min{l,Ki/2}, let%(») be specified by-y E : = £ /V(m+i)(2s+3) _ 
For every a G (0, 1), the family 

£ -W(M+l)(2 S +3)^ (a) _ /3(a) ) 

is tight under Pi , as soon as 

(3e s(s)nn(Ki)n% 2 ) 

for s < maxjiV, K2 — 1} and ae~ 3 remains bounded. 

A proof of the (near)-optimality in the sense of the lower bound Defini- 
tion [3]3] and in the spirit of Theorem [33] is presumably a delicate problem 
that lies beyond the scope of the paper. More in Section FOl 

4 Discussion 

4.1 Binary fragmentations 

The case of binary fragmentations is the simplest, yet an important model 
of random fragmentation, where a particle splits into two blocs at each 
step (see e.g. [9], [8]). By using representation (13.6|) . if we assume further 
that p{ds\) = p(si)dsi is absolutely continuous, so is 7r(dx) = ir(x)dx and 
we have 

vr( x ) = e -^( /3 ( e ^)l [0]log2] ( x ) + p(l- e -^)l (log2i+oo) ( x )), (4.1) 

for x G [0, +oo) and 

/3(a) = a(p(a)l [1/2il] (a) +p(l - a)l [0il/2) (a)) , a G [0,1]. 

In particular, the regularity properties of /?(•) are readily obtained from 
the local smoothness of p(») and its behaviour near 1/2. For instance, if 
p(a + 1/2) = 0(a K ~ 1 ) near the origin, for some k > 0, then 
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4.2 Concerning Theorem 13.11 



Optimal rate of convergence. First, Theorem 13.11 readily extends to 
error measurements of the form E [|£ £ (<?) — £(g)| p ] with 1 < p < 2. 
The rate becomes £ -/W2(m+i) i n (J37Q an d a P £ -p [ n un der the less 
stringent condition /i < K/2p. 

Generally speaking, we obtain in (|3,ip the (normalized) rate e' i / 2 (/ i + 1 ) 7 
for any [x < k. Intuitively, we have a number of observations that should 
be of order e _1 , so the expected rate would rather be e 1 / 2 . Why cannot we 
obtain the rate e 1 / 2 or simply e K / 2 ( K+1 )? The proof in Section [5.11 shows 
that we loose quite much information when applying Sgibnev's result (see 
Proposition 16,11 in Appendix) on the kew renewal theorem for a random 
walk with step distribution 7r(») in the limit log(l/e) — ► +00. 

Proposition 16.11 ensures that if 7r(») has exponential moments up to 
order k, then we can guarantee in the renewal theorem the rate o(e^) 
for any fi < k with some uniformity in the test function, a crucial point 
for the subsequent statistical applications. It is presumably possible to 
improve this rate to 0{e K ) by accomodating Ney's result [15] . However, 
a careful glance at the proof of Theorem 13.11 shows that we would then 
loose an extra logarithmic term when replacing e^/ 2 (^ +1 ) by £ k /( 2k + 1 ). 
More generally, exhibiting exact rates of convergence in Theorem 13.11 
remains a delicate issue: the kew renewal theorem is sensitive to a 
modification of the distribution outside a neighbourhood of +00, see e.g. 
Asmussen [2], p. 196. 

Uniformity in vr(»). A slightly annoying fact is that convergence (13. ID 
is not uniform in vr(»), which can become a methodological issue for 
the statistical applications of the subsequent Theorems 13.21 and 13.51 in 
particular if min-max results are sought. An inspection of the proof in 
Section [5. II shows that we loose information about the uniformity in 7r(») 
when applying Proposition 16.11 again. A glance at the proof of Sgibnev's 
result suggest that uniform results in 7r(») could presumably be obtained 
over classes of 7r(») defined in terms of appropriate bounds on their Stone 
decomposition [17] . 

The non-conservative case. If Assumption B is dropped, we define 
P- = inf {p > 0, f sl S ?K^ S ) < +°°} an d make the so-called 
Malthusian hypothesis: there exists a (unique) solution p* > p_ to the 
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equation 

oo 

Js i=i 
The empirical measure now becomes 

The choice of the weights is motivated by the fact that the process 
(S| u |=ri£w > n — O) is a positive martingale. We denote by Aioo its ter- 
minal value. Note that under Assumption B, we always have p* = 1 and 
TWoo = 1- Bertoin and Martinez [7j prove the convergence of £e P (g) to 

c(^) 7 a Jsi 

in probability, as e — > 0, with now c(z/) := — J* 5i ]CSi s f l°g s « ^(ds). In 
this setting, Theorem 13.11 becomes 

Corollary 4.1. Grant Assumptions A, C and the Malthusian hypothesis. 
Let < k < oo and m > 0. For every < fi < k, we have 

sup E fo) - ^(s)) 2 ] = o(£^ +p *)). 

geC(m) 

4.3 Concerning Theorems 13.21 and 13.51 
The parametric case. We obtain the rate 

( £ /V(M+l)-p/(2"2+2) 5 forall/ , <Kl 

which can be made arbitrary close to the lower bound e 1 / 2 by assuming 
Ki and K2 large enough. The factor + 1) comes from Theorem 13.11 
whereas the factor K2/( 2^2+1) arises when using the technical assumption 
7r 6 1Z(k2)- We do not know how to improve it. 

Also, the condition ue -3 = 0(1) is fairly restrictive, and can be 
readily improved by assuming that K2 is large. Indeed, if K2 > 1, which 
only amounts to require that 7r(») is bounded near the origin, a glance 
at the error term (|5.10p in the proof Section 15.21 shows that the condi- 
tion drops to oe~ 2 = 0(1). In the limit K2 — > 00, we obain ae~ 3 ^ 2 = 0(1). 
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The non-parametric case. The situation is a bit different than in the 
parametric case: we obtain now the rate 

( £ M/(M+D) s /(2 S +3) 5 forall/U<Kl 

for the estimation of /3(a) for any a G (0,1). In the limit K\ — > +00 it 
becomes e s /( 2s+3 \ which can be related to more classical models: con- 
sider the apparently different problem of recovering a function /?(•) in the 
integral white noise model 

dY a = Kj3{a)da + e 1/2 dW a , a G [0, 1], (4.2) 

from the observation of (Y a , a G [0, 1]). Here, (W a , a G [0, 1]) is a standard 
Brownian motion and K(3(a) := (3(u)du is the integration operator. 
Model (|4,2p serves as a toy representation for the problem of recovering 
a signal in white noise at level e 1 / 2 , when the observation is obtained 
from the action of a smoothing linear operator with unbounded inverse 
(here K). The difficulty of the problem is quantified by the degree of ill— 
posedness of the operator (equal to v for z^-fold integration; here v = 1). 
The well-known optimal rate (see e.g. |12j ) of pointwise recovery for a 
function (3 G S(s) is 

e s/(2s+2u+l) _ £ s/(2s+3) 

The factor 1v is a futher penalty in the rate of recovery quantifying the 
smoothing action of K. The same phenomenon seems to occur in the 
setting of fragmentation chain. Put a := here for simplicity. For a test 
function g(»), we can form the observation 

£ e (g) « £(g) = C g{u)K(3{u)du 

up to an error of (near)-order e 1 / 2 . If we discard the pre- factor mi (71") 
(which can be estimated at a fast rate when K2 is large) we obtain the 
same kind of statistics in Model (|4.2p by considering 

g{a)dY a = f g(u)Kf3{u)du + e 1/2 AA(o), 
Jo 

where Af(g) is centred Gaussian with variance f g(a) 2 da. Note that 
the order of the variance in the noise term N{g) is consistent with 
the improvement obtained in Proposition 15.21 in the proof Section 15.21 
if g G C be (m), we have /J g(afda < b e . 

This suggests the (near)-optimality of the result in the sense of Defi- 
nition (3]3] but a complete proof lies beyond the scope of the paper. 
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4.4 Other statistical issues 

Observation of the whole path of X. Suppose we observe contin- 
uously in time the sample path of X up to some fixed time T > 0. 
Asymptotics are taken as T — > oo. Equivalently, we observe all triples 
(£u) a u-, Cu) f° r every u in the random set 

U{T) := {ueli, a u < T) 

with the restriction that Q u is set to T — a u when a u + Q u > T. We denote 
by U{T_) the subset of U(T) such that a u + Q u < T. In this setting, 
statistical inference about the the self-similar parameter a > and the 
dislocation measure v(ds) is relatively straightforward: 

Estimation of a: conditional on (£ u ,u G U(T-)) the sequence of ran- 
dom variables (<^ u , u G U(T)^j are independent and follow exponential dis- 
tributions with parameters Conditional on CardZY(T) = n and since 
the law of the (£, u ,u G U) does not depend on a we are in the setting 
of estimating a one-dimensional parameter from n independent random 
variables with explicit likelihood ratio 

n 

«-n^ ex p(^), (4.3) 

i=i 

where the Ui range through U(T). The main difficulty remains that the 
law of CardW(r) usually depends on a. 

Estimation of for u— G W(T_), when the fragment of size £ u 
splits, conditional on £ u = x, it gives rize to the observation of the 
rescaled size of its offsprings G N) which is a realization of the 

law u(ds). As a consequence, conditional on CardZ//(T) = n, we observe 
a sequence of n independent and identically distributed random variables 
with law v(ds). We are back to the classical problem of estimating a 
probability distribution from an n-sample. 

More about estimating a. We cannot estimate the index of self- 
similarity a from the data X £ . However, if we add the possibility to "tag" 
a point at random on the initial fragment and if we can observe the 
random time T £ when the tagged fragment becomes smaller then e, then 
identifying a from the sole observation of T £ is possible. 

7 in physical terms, we must be able to identify the mass or length of the fragment. 
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In the case a > 0, if x(t) denotes the size of the tagged fragment at 
time t, then 

T £ = {t>0, X (t)<e}. 

Applying Proposition 3 of [6], the distribution of e a T £ under Pi is tight 
as £ — ► 0. Therefore, the rate — 1/loge is achievable for estimating a and 
it is attained by the estimator log(T E )/log£. More precise results about 
limit laws can be obtained from [6]. 

5 Proofs 

We will repeatedly use the convenient notation a e < b e if < a £ < cb e 
for some constant c > which may depend on 7r(») and m only, any other 
dependence on other ancillary quantities being obvious from the context. 
A function g £ C(m) is tacitly defined on the whole real line by setting 
g (a) = for a £ [0, 1]. 

5.1 Proof of Theorem I3J1 

Step 1: A preliminary decomposition. We first use the fact that 
for r] > e, during the fragmentation process, the unobserved state X v 
necessarily anticipates the state X e . The choice r\ = 77(e) will follow later. 
This yields the following representation: 

where, for each label v £ U„ and conditional on X„, a new indepen- 
dent fragmentation chain , w G 14) is started thanks to the branching 
property, see Section 12.11 Define now 

X v (v) := l{t v _> V £ v < v }€v 

and 

We obtain the decomposition of S e (g) — £{g) as a sum of a centred and a 
bias term: 

£ e (g)-£(g) = M etV (g) + B £ , v , 
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with 

Me, v (9) :=J2Xr,{v)[Y e (v 1 g)-E[Y e (v i g)\X v (v)]) 
veu 

and 

B £tV (g) :=Y,K(v){E[Y £ (v,g)\\ v (v)] -8(g)), 

where we used (|2.1j) in order to incoporate the limit term £ (g) within 
the sum in v. 

Step 2: The term M e ^(g). Conditional on the sigma-field generated 
by (l^ u _ >r] y^ u ,u— G U), the variables (Y £ (v,g),v G U) are independent. 
Therefore 

E[M £ , n (g) 2 ] < ^2E[X v (v) 2 E[Y e (v,g) 2 \X v (v)]], (5.1) 

thus we first need to control the conditional variance of Y £ (v,g) 2 given 
A„(u) = u, for < u < rj, since P-almost surely, X„(v) < n. Moreover, we 
have Y £ (v,g) = on {A r; (u) < e}, hence we may assume e < u <rj. 

To this end, we will use the following representation property: 

Lemma 5.1. Let /(.) : [0, +oo) -> [0,+oo). Then 

E[ E =E*[/to))] s (5.2) 

where x(t) = exp ( — C(*)) an ^ (C(*)>* — 0) * s a subordinator with Levy 
measure 7r(») defined on an appropriate probability space (fl*,P*) 3 and 

T?7 :=inf{t >0, CW > - log 77}. 

The proof is given in Appendix l6.1[ In order to remain self-contained, 
we elaborate in particular on the construction of the randomly tagged 
fragment following the presentation of Bertoin [5] . 

We now plan to bound the right-hand side of (|5.1|) thanks to Lemma 
15.11 For < e < u < w, we have 

E [Y £ (v,g) 2 I X v (v) = u] = E [( £ ^(etT 1 ^)) 2 | \(v) = u 

w eU E/u 



<e[ £ eS ,) »(e«- 1 eS ,) ) 2 |A,(t;) = «] 



weu e/u 
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where we used Jensen's inequality combined with (|2,ip . Applying Lemma 
15.11 we derive 

E [Y £ (v,g) 2 | Xr,{v) = u] < E* ^(mTV^ 7 ^) 2 ] . (5.3) 

Let [/(•) denote the renewal function associated with the subordinator 
(C(t),t > 0). By Proposition 2, Ch. Ill in [3], the right-hand side of 
is equal to 

I dU(s) [ g(uE- l e- x - s fn{dx), 

J [o - \og{e/u)) J(- log(e/«)-s,+oo) 

„ oo 

= / dU(s) y2sili s<EU -i eS yg(s i u£~ 1 e'' s ) 2 u{ds) 

y[0,-tog(e/u)) 



< 1 „„„ a 



'c(tt) 



fllloo lo g( u A0> 



where we successively used the representation (|2.4p and the upper bound 
U(s) < s/c(n), see for instance Proposition 1, Ch. I in [3]. Therefore, for 

e < u < t], 

E [Y £ (v,gf | A» = u] < 4^\\9\\lc logfa/e). 

Going back to (|5.ip . since A^(u) 2 < r] X v (v) and using (|2.ip again, we 
readily derive 

E [M £ , v (g) 2 ] < -L^WgWlnlog^/e) < rjlogfa/e). (5.4) 

Step 3: The bias term B £ „(g). Note first that 

K[Y £ (v,g)\\ v (v)] = C 1 %„ [£e(g)], 
P-almost surely, henceforth 

Be.M = J2 X v( v ){& 1 % [ £ M -£(9))- (5-5) 

veu 

Conditioning on the mark of the parent u— = to of u and applying the 
branching property, we get that Eg u [£ e (<7)] can be written as 



E, 



CO 



ueu " ' Js " i=i 
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where the (£ w ,w G U) are the sizes of the marked fragments of a frag- 
mentation chain with same dislocation measure f(»), independent of 
(£ v ,v G U). Set 

„ oo 

Hg(a):= ^2l( t< ,s l g(s l e a )iy(ds), a > 0. 
•^' si i=i *- J 

It follows that [£ e (<7)] is equal to 



E, 



n=0 |oj|=n 
oo 

n=0 [ w |=n 



by self-similarity, with the notation \uj\ = n if u = (uj\, . . . , u n ) G IA. 
Using Lemma 1.4 in [5], we finally obtain 



E 



p=^' 



n=0 



where S n is a random walk with step distribution ir(dx). We plan to apply 
a version of the renewal theorem with explicit rate of convergence as given 
in Sgibnev [16], see Proposition 16.11 in Appendix 16.21 with rate function 
<p(a) := expQua) for some arbitrary \x < k/2 and dominating function 
r(a) := e _K l a L Indeed, for a < 0: 



I — ) J (— a, +00) 



n(dx) 



' (— a,+oo) 

by (|2.4p . Since g{») has support in [0, 1] and it G H(k), 



\H 9 (~a)\ 



< i \g[e 

(— a,+oo) 



l )|vr(dx) 



Therefore |l{ a < }#g(— a)| < r(a) for all a G M. 

Since k > 2/i, Assumption F of Proposition UTTl is readily checked. Let 
A > (depending on k, m and 7r(») only) such that, if log(£y/e) > A, 
then 

i r+oo 

C 1 % [£ e ( 9 )] - / tf fl (a)da 



E* [5 



1 ./o 



(5.6) 
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We next note that 

^ r+oo 

Introducing the family of events { log(£y/e) > ^4} in the sum (|5.5p . we 
obtain the following decomposition: 

B^g) 2 <I + II, 

with 

J -= E^{io g( w £ )>A}^ lE ^ U 9 )]-£{ 9 ))\ 

and 

v£U v 

By (15. 6p . we have 

Integrating w.r.t. P and applying Lemma 15. II and in the same way as in 
Step 2, we have 

E[J] <e 2 ^E* [ e 2 ^)] 

= e 2 » f dU{s) f e 2 ^ s+x \{dx) 

J[0,-\ogri) J (- log r;-s, +00) 

<e 2 » [ e 2 ^dU{s)<{er 1 - 1 ) 2t "\og{l/r 1 ) 

J[0-\o gV ) 

for small enough e and where we used tt £ II(k) with 2/i < k. For the 
term II, we first notice that by (12. ip and self-similarity, 

% [ E £"] = F ^ _almost surely, 

hence 

fete)] - «?(5)) 2 < 4|| 5 |&, P & -almost surely. 
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In the same way as for the term /, we derive 

E < E [ ^ 1 {-iog^>-A+io g (i/ £ ) 

V& Ur, 

= P* [((T v )>-A + log(l/e)} 



< / dU{s) / ir{dx) 

<^log(l/r?) 

for small enough e. Putting all the estimates together, we conclude 

E [B £ , v (9) 2 ] < + N" 1 ) 2 ") log(V^). (5.7) 

Step 4: Proof of (|3.ip . Putting the estimates (|5.4p and (|5.7p . we have 

E [(£ £ (g) - £(g)) 2 ] < E [M £ Jgf] + E [B £yV (g) 2 } 

< r)\og{r)/e) + (er ? " 1 ) 2/1 log(l/r?). 

The choice 77(e) := £ 2 /V(2m+i) yields the rate £ W(2m+i) log(l/e). Since 
/x < re/2 is arbitrary, the conclusion follows. 

Step 5: Proof of (|3.2p . We plan to use the following decomposition: 

SeM-Seig) =1 + 11, 

with 

1 := 53 ( 1 {?i < L ) > £ ,el <T) <£} ~ 1 ««->s,€„<e})^T ) 5 

and ^ 

where we have set 2v := (i Lw., 1 • Clearly, 



- 1 {«i CT) > £ ,? u -< £ } + 1 {d" ) <e,{«>e} 
+ 1 {^-> £ ,?i < L ) < £ } + 1 Ku<e,d a) >6}- 



Let <5 > cr/e and w = u or u— . Since \U W \ < 1 for every u>, we readily 
check that 

>e,&<e} C {{l-5)e<^<e} 
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and 

{&,>£, ^ <e}c{e<^<(l + 5)e}. 
It follows that \I\ < III + IV, with 

and 

iv : = £ i{ (1 - <5 ) £ < € «<(i +< y) e }|ei CT) 5 (e4 CT) /^) I- 

By picking S is small enough, we may (and will) assume that £i < £ u . By 
(|2.ip . conditioning on the mark of the parent u— = u of u and applying 
the branching property, E [-f^/ 2 ] is less than 

„ oo 

E [^2 1 {Ci-5)e<Z v <s(i+6)}Cv / ^2s i g(e~ 1 (^ v s i + oU v )) 2 v(ds) 
veu Jsl i=i 

= E [E 1 {(l-'5)E<^<£(l+5)}^Gi(^ a; )J , 
with 

„ oo 

Gi(a):= / Va i E[0(e- 1 (aa i + (rE7)) 2 M ds ) 
and U distributed as the U u . Likewise, 

e[jf 2 ] <i[^i {(1 _ 5)£ <^< £(1+l5)} ^G 2 (e u ) , 

with 

G 2 (a) :=E[g(e- 1 (a + aU)) 2 }. 

For i = 1,2, the crude bound |Gj(a)| < WgW^ and the genealogical rep- 
resentation argument used in Step 3 enables to bound either K[III 2 ] or 
K[IV 2 } by 

oo 

ML E r [ - + ^) < ^ - lo g(Ve) < - log(l " 5)] 

n=0 

where S n is a random walk with step distribution 7r(»). We proceed as in 
Step 3 and apply Proposition 16.11 The above term converges to 

mi(7r) _1 log (yZ^) ~ 6 



23 



uniformly in 6, as soon as 5 is bounded, at rate for any < fx < k, 
and thus is of order 5 + . 

We now turn to the term II. We have II := V + VI + VII, with 
V:= 52S»(gffl/e)-g(Me) 



ueu e 



From (7 € C'(m), ()2.ip and Jensen's inequality we derive 



e[^ 2 ] < Ib'IlL^" 2 . 



From |J7 U | < 1 and the inclusion {£u > ^e} C > i £ — er} we derive 



< — 

~ e 2 ' 



where we used that i e = 7o£ with < 70 < 1. Likewise, the inclusion 
{£u^ < t £ } C {£ u < t £ + a} and Lemma [5TT1 yield 

E [F// 2 ] < IbllL P* [ " log x(T £ ) > ~ log(t e + a)] < e» log(l/e) 

for any ^ < k, in the same line as for the bound of the right-hand side of 
B in Step 2. 



Putting all the estimates together with, for instance, 5 := a/2e we 
obtain (|3.ip . The proof of Theorem 13.11 is complete. 



5.2 Proof of Theorem EOl 

Preliminaries. Let < 6 £ — » as e — > 0. For m > 0, define the class 

C 6e (m) := {5 G C(m), |supp(g(.))| < m6 e }. 
We have the following extension of Theorem 13.11 

Proposition 5.2. Grant Assumptions A, B and C. In the same setting 
as Theorem \3.1\ if in addition, we assume k > 1, then, for every [i < n 

sup E [(£ £ (g) - £(g)) 2 ] = o(e^ +1 h £ ). 

g&C be (m) 
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Proof. We revisit carefully Steps 2 to 4 of the proof of Theorem 13.11 under 
the additional Assumption C, and we write g(») = g £ (») to emphasize that 
g(») may now depend on the asymptotics. 



In Step 2, the right-hand side of (|5.3p is now bounded by the following 
chain of arguments: 

- log(e/M) r+oo 2 

dll(s) / g £ (ue~ 1 e~ x ~ s ) %(x)dx 

J — log(e/u)— s 

- log(e/u) ,-eu- 1 e s 

dU(s) / g £ (xue~ 1 e~ s ) (3(x)dx 



<||/3||oo« -1 e 



e s dU(s) [ g £ (x) 2 dx <b £ log{u/e) 
M) Jo 



'[0-log(e/«)) 

where we used that | supp(g e )| < b £ and U(s) < s/c(ir) again. Note that 
ll/3||oo ^5 1 since k\ > 1 and K2 > 1. Therefore 

K[Y £ (v,g) 2 \X r] (v) = u] <b £ , 

Hence 

E [M £i?? ( 5 ) 2 ] < 6 e 7/. 

In Step 3, we replace g(») by g £ (») in £ e (g) and £ (g). We have, for any 
< fl < K, 

1 f 1 \g £ (a)\ [+™ 

' " 7r(x)dxda 



\£(9e)\ < / 
c(vr) 7 



a Jlog(l/a) 



< l \ 9e {a)\aP- x da<b e 
Jo 



for \i > 1 and since 7r G with k > 1. By Cauchy-Schwarz, for a < 0, 

/ /' +00 o \ 1/2 / /'+ 00 \ 1/2 

\H ge (-a)\<[J g £ {e- x - a )\(x)dx) [J n(x)dx) 
<e^(j\ £ (y) 2 (3(ye a )d y y /2 e^ 

< tl/2 o(1+k)/2 

using again that ||/3||oo ^5 1- Therefore ^~ 1 E^ ii [£ e (<7 E )l < 6^ 2 , and we can 
apply Proposition 16.11 with rate function ip(a) = exp(//tt) and dominating 
function r(a) := e -(i+«)M/2. 
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The terms / and II are bounded in the same way. We obtain 

for \x < min{K/2, (1 + k)/2} = re/2, uniformly over the class C& e (m). The 
trade-off between M e ^ v (g e ) and B e ^(g e ) yields the result. □ 

Completion of proof of Theorem 13.21 We first write 

1 f 1 

£(9ie) ~ 1 = — r~: / (/ 7 e( a ) - l)P(a)da. 

We have 

(f Js (a)-l)P(a)da <2 / ^ ^ n(x)dx < ^ 

'l-7e JO 

since tt £ 1Z(n2) and — log(l — x) < x for small enough x > 0. We derive 

l^j-mi^r 1 !^ 2 - ( 5 - 8 ) 

Next, by construction, 7g5 7e 6 C 7e (l), hence for any < /x < K\, Propo- 
sition 15.21 entails 



E[\£ £ ( g ^-£(g,j\]<^e^^. (5.9) 

Moreover, since 7 e g 7e G C'(l), we have, by (|3.2h in Theorem 13.11 

E [\£ £ {g lE ) - £eA^)\] < Is 2 ™' 1 - (5-10) 

The specification 7 e = £^/(^+ 1 X 2k2 + 1 ) yields the correct rate for (j5.8l) and 
(j5.9|) . The assumption that ere -3 is bounded ensures that the right-hand 
side in (15.101) is asymptotically negligible. The conclusion readily follows 
for fhi >e . 

We now turn to higher moment estimators. Thanks to the proof for 
the case k = 1, it suffices to show that 

mi(7r)£ e (<7 7e ) -> / log(l/a) fc /?(a)(ia 

JO 

in probability with the correct rate as e — ► 0. Note first that 

5 7£ (a) = -alog(l/a) fc 4(l - a) + fc/ 7(t (l - a) logfl/o)*- 1 
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is a sum of a function in C 7e (ci) and a function in £'(02), for some positive 
ci and C2, hence we may apply Proposition 15.21 and Theorem 13.11 to each 
term respectively. Next 

c(ir)£(g le )=- ! (/ 7£ (l-a)log(l/a) fc )' f (3(u)duda 
Jo Jo 

= f / 7e (l-a)log(l/a) fc /3(a)da, 
Jo 

since k > 1 and 7r G II(ki) so that the boundary terms vanish when 
integrating by part. We conclude by noticing that by Holder's inequality, 
for any r > 0, we have 

| f(l-/ 7E (l-«))log(l/a)%)da| 
Jo 

/+oo r+oo 
Tr(x)dx) 1 ~ T ( / x k/r ir(x)dx) T < ^- T \ 
- log 7 e Jo 

This term also has the right order since the choice of r is free and k\ > K2 
by assumption. The proof of Theorem 13.21 is complete. 

5.3 Proof of Theorem f3T4l 

With no loss of generality, we consider the homogeneous case with a = 0. 
We may also assume that a = 0, since adding experimental noise to the 
observation of the fragments only increases the error bounds. 

Step 1: An augmented experiment. In the binary case, the dislocation 
measure v(ds) is equivalently mapped by a probability measure on [1/2, 1] 
with density function a p(a), see (13.6h . 

We prove a lower bound in the augmented experiment where one can 
observe all the sizes X £ of the fragments until they become smaller than 
e, namely 

Clearly, taking the infimum over all estimators based on X £ instead of 
X £ = X e fi only reduces the lower bound. 

For every u E U £ , we have > e. By the conservative Assumption 
B, there are at most e~ l such £„_ so CardZ^ e < 2e~ l . For every node 
u £ U, the fragmentation process gives rize to two offsprings with size 
^ U U and £ n (l — U), where U is a random variable independent of £ M with 
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density function p(»). Therefore, the process of the sizes of the fragments 
in the enlarged experiment can be realized by less than 

2e-i(i + § + . . . + --y < [As' 1 ] + 1 =: n{e) 

independent realizations of the law p(»), where k(e) := log 2 (2/e), assumed 
to be integer with no loss of generality. 

In turn, Theorem 13.41 reduces to proving that e 1//2 is a lower rate 
of convergence for estimating m k (ir) based on the observation of a 
n(e)-sample of the law /?(•). The one-to-one correspondence between p{») 
and 7r(») is given in (14, lj) . 



Step 2: Construction of ir E . We write /?7,-(») to emphasize the depen- 
dence upon 7r(»). Let 

ip k (a) := a(log(l/a) fe + (1 - a) log (1/(1 - a))", a G [1/2, 1]. 
From (14. ip . we have 

m k (iro) = / fk(a)pn (a)da. 
J 1/2 

Let < r < 1. Pick a function ipk(») : [1/2, 1] — > M such that 

ll^fclU < rinf P7ro(a), / ipk(a)da = 0, r(k) := / ip k (a)if>k(a)da ^ 0, 

a choice which is obviously possible thanks to Assumption D. For e > 0, 
define 

P7r £ (a) :=fr (a)+£ I/2 4(a), aG [1/2,1]. 

(And so (|4.1|) defines vr £ (») unambiguously.) By construction, p^i*) is a 
density function on [1/2, 1] and has a corresponding binary fragmentation 
with Levy measure given by 7r e (»). Moreover, 

m k (7T £ ) = m k (TT ) + r(k)e 1/2 . 



Step 3: A two-point lower bound. The following chain of arguments 
is fairly classical. We denote by ¥ n the law of the independent random 
variables (Ui,i = 1, . . . , n(e)) with common density /%(•) that we use to 
realize the augmented experiment. 
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Let F £ be an arbitrary estimator based on X e . Put c := \r(k)\/2. We 
have 

max F n [e~ 1/2 \F e - m fc (vr)| > c] 

7re{7ro,7r e } 



> 



1 (p^ [e- 1/2 \F £ - m k (ir )\ > c] + F We [e- 1/2 \F £ - m k {^ £ )\ > c]) 



> i E 

— 2 w o 



l{ £ -i/2|ir e _ mfe (7ro )|> c} + l {s-V*\F e -m k {-Ks)\>c} 



§11^0 -^llTV, 



where ||»||tv denotes the total variation distance between probability 
measures. By the triangle inequality, we have 

£- 1/2 (|F e - m k (ir )\ + \F £ - m k {^ £ )\) > \r{k)\ = 2c, 

so one of the two indicators within the expectation above must be equal 
to one with full P„- -probability. Therefore 

max P,,- [e~ 1/2 \F £ - m k (7r)\ >c]> i(l - 

7re{7ro,7r e } 

and Theorem 13.41 is proved if 

limsupHP^o — PttJItv < 1- 



\ -^ e \\TV), 



(5.11) 



By Pinsker's inequality 



V2 



no 



7T: \TV < ^ (Pttq 



log--— 

dF^ 



1/2 



and 



E 



-(i 



log 



dF W£ 



n(e) ^ 



i=l 



log 



Pn s ( U i. 



n(e) 



= - ^E, [log (1 +e 1/2 MU i )p m {U i )~ 1 ) -^MU^P^Mr 1 

i=l 

where we used 

v-l 



^ k {a)da = 0. 



1/2 



We also have £ 1 ^ 2 \'(p k (Ui)p 7TQ (Ui) 1 \ < re 1//2 hence for small enough r, 
\-\og(l + e l l 2 MUi)p,MY 1 ) +e 1/2 MUi)PnMr 1 \ <r 2 e. 

Therefore UPttq— PttJItv < ^ L TE 1 / 2 n{e) 1 l 2 < 1 by picking r small enough, 
and (|5,lip follows. The proof of Theorem 13.41 is complete. 
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5.4 Proof of Theorem 1331 



We plan to use the following decomposition 

0(a) - (3(a) = mi, e £ e>a ( -. <^ e , a (.)) - (3(a) =1 + 11 + 111 + IV, 
with 



I : = m 1]£ [£ £j)J ( - . </4, a (»)) - ^( - • ^ e ,aW 
II := mi )£ - • ^, a (»)) - £ ( - • <p' 7e , a (») 

III := (mi )E - mi(-Tr)) £ ( - . V^.aW) ' 

IV :=m 1 (^)£(- ,^ a (.)) - (3(a). 

Considering I and II, the term mi j£ is bounded in probability by 
Theorem 13.21 By (j3.2|) in Theorem 13.11 together with the fact that 
G C'dl^'Hoo), we have 

E [|f e ( - .<,„(.)) - SsA " ^ % 3 ^' 1 - 

By construction, we have j^»ip' a (») G C 7e (||</?||oo)> therefore, by Propo- 
sition 15.21 

E[(f e (-.^») -£(-.^ a (.))) 2 ] < 7 -3 £ M/(M+l). (512) 

Considering III, using (|5.8p . we have — • V^aW) | ^ By Theorem 
13.21 we conclude that III 2 has order 

7 -2 e 2^ K2 /(^+l)(2 K2 +l) ( 5 _ 13 ) 

in probability. For IV, we first notice that 

m 1 (7r)£( -,<p' (•)) = [ ip leia (u)(3(u)du, 

Jo 

hence ^ 

IV 2 = ([ ^ a (u)(3(u)du - (3(a)) 2 < 7e 2s (5.14) 



by a Taylor expansion and using that the terms up to order s — 1 vanish 
by the cancellation property (|3.T|) of (p(») since s < N. 

Putting together (|5.12p . (|5.13p and (|5.14p . we see that the specification 
7 e = £w(m+ 1 )( 2s + 3 ) yields the correct rate for II and IV, as well as for III 
as soon as «2 > s + 1. Finally, the term I proves asymptotically negligible 
in the same way as (|5.10p thanks to the assumption that ae^ 3 is bounded. 
The proof of Theorem 13.51 is complete. 
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6 Appendix 



6.1 Proof of Lemma 15.11 

First, we enrich the structure of the genealogical tree representation of 
Section[2]by adding a random mark M : U — ► [0, 1] x [0, +oo) x [0, +oo) on 
the tree together with a random branctjl of U and define unambiguously 
the law F* of (M, 0) by setting 

E* [$ n (M, 0)] := E [ ®n(M, u)£«] , n > 1, 

|u|=n 

where <E> n is a bounded functional which depends on the mark M and the 
branch up to the n-th generation only. If n is the node of the random 
branch at the n-th generation, we set Xn '■= £a n and x(t) f° r t ne si ze °f 
the tagged particle at time t: 

Y u\ ._ / Xn if Oft, < t < ap n + C/3„, 
* W ' I if i^lim^ooo^, 

where a/3 n and C,p n denote respectively the birth-time and lifetime of the 
particle labeled by the tagged node n . We then have 

oo 

ie [ £ = e [£ E V^W 6,7 ^ 

u6W e n=0|v|=ri 
oo 

n=0 L J 

By Proposition 1.6 in [5], — log^n is a random walk under P* with step 
distribution ir(dx). In particular, the last term above is independent of a. 

Consider now a homogeneous fragmentation process with same dis- 
location measure v{») living on the same (possibly enlarged) proba- 
bility space for simplicity. Applying the same construction above, we 
obtain a process (x (t),t > 0) that can be expressed in the form 
X^(t) = exp (— where {((°\t),t > 0) is a compound Poisson pro- 
cess with jump intensity 1 and jump distribution vr(»). By construction, 

we have x^i^i ) = Xn^ on the event {Xn-i — V: Xn^ < v} therefore 

oo 

E* [/(x (0) (Tf))] = J> [\^ v , x ^ <v} f(x { n- 

n=0 

8 A branch is an infinite sequence of positive integers which we can think of as the 
line of ancestors of some leaf of the tree. 
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The conclusion follows by identifying the right-hand sides of the last two 
equalities. 



6.2 Rates of convergence in the key renewal theorem 



We give a version of Sgibnev's result [16J on uniform rates of convergence 
in the key renewal theorem, which is proved in a more general setting. 

Let F{dx) be a probability distribution with positive mean m and 

renewal function H = E^o-F™*' with F °* := 6 o, F 1 * ■= F and 
F (n+i)* := F*F n *, n y Q 

We assume that F is spread-out, that is, for some n > 1, F* n has a non- 
zero absolutely continuous component. Stone [T7] shows that then there 
exists a decomposition H = Hi + H2, where H2 is finite measure and H\ 
is absolutely continuous with bounded continuous density function hi*) 
such that lim x ^ +00 h(x) = m~ l and linxj^-oo h(x) = 0. 

We denote by T(F) the u-finite measure with density function 



F(du)l [0t+oo) (x) - / F(du)l(_ OOi0) (s). 

[x, +00) J (—00,1] 

and T 2 (F) := T(T(F)). Let ip(.) : R -> [0, +00) be a submultiplicative 
function, i.e. such that y(0) = 1, ip(x + y) < cp(x)(p(y). Then we have 
(see for instance [10], Section 6) 

log(/?(x) log99(x) 
— 00 < ri := hm < hm =: r2 < +00, 

x— >— 00 x X— >+oo x 

Assumption E. VFe have T\ < < T2 and there exists r(») : R — > R on 

integrable function and such that the following conditions are fulfilled: 

sup \r(x)\(p(x) < +00 , lim r(x)<p(x) =0, 

x |x|^oo 



lim f(x) / r{u)du = lim ip{x) I r(u)du = 0, 

and 

<p(x)T 2 (F)(dx) < 00. 



Sgibnev's result takes the form: 
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Proposition 6.1. (Theorem 5.1 in U6\/ ). Grant Assumption E. Then 



lim tp(t) sup g-kH(t) — m 1 / g(x)dx 

Q,|g(3:)|<|r(x)| " JM. 



0. 



g,\g(x)\<\r(x)\ 

We call ip{») a rate function and r(») a dominating function. 
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